Web pages search engine based on DNS
نویسندگان
چکیده
Search engine is main access to the largest information source in this world, Internet. Now Internet is changing every aspect of our life. Information retrieval service may be its most important services. But for common user, internet search service is still far from our expectation, too many unrelated search results, old information, etc. To solve these problems, a new system, search engine based on DNS is proposed. The original idea, detailed content and implementation of this system all are introduced in this paper. 1 Introduction When designing a search engine we may meet two main bottleneck problems. Because the WWW is a large distributed and dynamic world, search engine can't continue to index close to the entire Web as it grows and changes. So we will meet some serious problems in coverage and recency. According to the statistical data in 1998[1], the update interval of most pages database is almost one month and no a search engine can cover more than 50 percentage pages on Internet. Till now, these data is still available. Although there was no any obvious improvement, there were surely some effective efforts for a better web pages search engine. Harvest [2] is a representative distributed information retrieval system. Based on Harvest, Cooperate Search Engine (CSE) [3] is developed. These two methods require each web site indexing their web documents and provide interface for search engines. These approaches will reduce the update interval and network traffic, but none of them is widely applied. This is mainly because not all administrators of sites will agree to index their pages for search engines. Reference [4] gave a practical idea to build a distributed search engine. In this paper, Author advised to share the databases of the search engine and introduced a layered architecture to improve the access to data on the Internet. But in this paper, the author didn't give an applied method on how to implement his idea. We develop our new search engine on the foundation of previous works. 2 The update of DNS The original idea of new system could be found in the history of WWW. When the DNS comes into being, there are only hundreds of web sites, so we can put DNS table in single server. When the number of sites reached level of million and scattered in different place, several DNS can't work efficiently. So DNS developed into a distributed hierarchical system. …
منابع مشابه
A New Hybrid Method for Web Pages Ranking in Search Engines
There are many algorithms for optimizing the search engine results, ranking takes place according to one or more parameters such as; Backward Links, Forward Links, Content, click through rate and etc. The quality and performance of these algorithms depend on the listed parameters. The ranking is one of the most important components of the search engine that represents the degree of the vitality...
متن کاملWeb search engine based on DNS
Wang Liang, Guo Yi-Ping, Fang Ming 1, 3 (Department of Control Science and Control Engineering, Huazhong University of Science and Technology, WuHan, 430074 P.R.China) (Library of Huazhong University of Science and Technology, WuHan 430074 P.R.China) Abstract Now no web search engine can cover more than 60 percent of all the pages on Internet. The update interval of most pages database is almos...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملمدل جدیدی برای جستجوی عبارت بر اساس کمینه جابهجایی وزندار
Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.NI/0403035 شماره
صفحات -
تاریخ انتشار 2004